Back

npj Genomic Medicine

Springer Science and Business Media LLC

Preprints posted in the last 7 days, ranked by how well they match npj Genomic Medicine's content profile, based on 33 papers previously published here. The average preprint has a 0.03% match score for this journal, so anything above that is already an above-average fit.

1
Evaluating splicing factor and kinase network crosstalk through global phosphoproteomics

Crowl, S.; Singh, S.; Zhang, T.; Naegle, K. M.

2026-04-21 systems biology 10.64898/2026.04.16.718710 medRxiv
Top 0.1%
3.6%
Show abstract

Both splicing and kinase signaling are biochemical processes that fundamentally determine and shape cell physiology. Although there has been some indication that there is an interaction between the two - splicing can alter the availability of exons encoding kinase targets and kinases can phosphorylate splicing factors - it has yet to be established the extent to which altering splicing factor expression impacts kinase signaling networks. In this work, we implemented a data-driven analysis using ENCODE RNA-sequencing data and prior work mapping post-translational modifications onto splice events to identify candidate splice factor perturbations that show extensive alterations to phosphorylation-encoding protein products. We then replicated the ENCODE knockdown experiments and performed global phosphoproteomics for two candidates, U2AF1 and SRSF3, complementing the transcription-level data. Both knockdowns showed extensive changes in phosphorylation and kinase activities, both basally and upon receptor tyrosine kinase stimulation. U2AF1 knockdown drove decreased JNK-associated cell death signaling but elevated chromosome regulation through CSNK2A1, PLK, and EIF2AK4 activity. SRSF3 knockdown, on the other hand, led to decreased cell cycle signaling through CDK and HIPK2 but increased cytoskeletal signaling through various PAKs. In addition, we found a striking enrichment of phosphorylated splicing regulators in both knockdowns that were linked to their splicing activity, such as HNRNPC, suggesting potential feedback and crosstalk between splice factors through signaling pathway activation. Importantly, comparison of differential phosphorylation measurements from this study to mRNA expression and splicing measurements from ENCODE revealed significant knockdown-dependent protein regulation, not captured by transcriptomic measurements alone, underscoring the value of phosphoproteomic profiling after splice factor perturbations. Combined, the transcriptomics and phosphoproteomics reveal deep interconnection between the two processes that are relevant to understanding cell signaling in health and disease.

2
Comparative fine-mapping of breast cancer susceptibility loci using summary statistics methods and multinomial regression

O'Mahony, D. G.; Beasley, J.; Zanti, M.; Dennis, J.; Dutta, D.; Kraft, P.; Kristensen, V.; Chenevix-Trench, G.; Easton, D. F.; Michailidou, K.

2026-04-22 epidemiology 10.64898/2026.04.21.26351364 medRxiv
Top 0.1%
3.6%
Show abstract

Summary statistics fine-mapping methods offer advantages over classical methods, including avoiding data-sharing constraints and improved modelling of correlated variables and sparse effects. However, its performance has not been comprehensively evaluated in breast cancer using real-world data. Previous multinomial stepwise regression (MNR) fine-mapping analyses for breast cancer identified 196 credible sets. Here, we apply summary statistics fine-mapping, compare methods, and assess parameters influencing performance. Using summary statistics from the Breast Cancer Association Consortium, we compared finiMOM, SuSiE, and FINEMAP to published MNR results across 129 regions. Performance was assessed by recall using in-sample and out-of-sample LD. Discordant credible sets were examined for technical factors, and target genes were defined using the INQUISIT pipeline. SuSiE showed the closest agreement with MNR. Results varied across regions depending on the assumed number of causal variants (L), with higher values reducing recall and no single L maximising performance. At optimal L per region, SuSiE identified 8,192 CCVs in 244 credible sets, with recall of 88%, 86%, and 72% for overall, ER-positive, and ER-negative breast cancer. Thirty MNR sets were missed. Discordance was partially explained by allele flips, imputation quality, and array heterogeneity. Fifty-two MNR-identified genes, including BRCA2, WNT7B and CREBBP were not recovered, while additional candidate genes were identified. Using out-of-sample LD reduced recall by 3% but identified novel variants. Fine-mapping results vary across methods, and no single approach is sufficient. The choice of L strongly influences results, and combining analytical approaches with functional validation can improve causal variant identification.

3
Duplication within 14q32.13 implicates a chimeric CLMN::SYNE3 RNA transcript in cerebellar ataxia

Litster, T. M.; Wilcox, R. A.; Carroll, R.; Gardner, A. E.; Nazri, N. M.; Shoubridge, C. A.; Delatycki, M. B.; Lohmann, K.; Agzarian, M.; Turella Divani, R.; Rafehi, H.; Scott, L.; Monahan, G.; Lamont, P. J.; Ashton, C.; Laing, N. G.; Ravenscroft, G.; Bahlo, M.; Haan, E.; Lockhart, P. J.; Friend, K. L.; Corbett, M. A.; Gecz, J.

2026-04-24 genetic and genomic medicine 10.64898/2026.04.23.26350376 medRxiv
Top 0.2%
3.0%
Show abstract

The spinocerebellar ataxias (SCAs) are a clinically heterogenous group of neurodegenerative disorders that affect movement, vision, speech and balance. Here, we reassign the linkage of SCA30 to 14q32.13 based on a cumulative LOD score >12. Within this interval we identified a 331 kb duplication, absent in population controls and not observed in >800 unrelated individuals with genetically unresolved cerebellar ataxia. RNASeq analysis of patient-derived lymphoblastoid cell lines revealed a splice-mediated chimeric transcript resulting from the duplication event. This transcript joined exon 1 of CLMN to exon 2 of SYNE3. In silico translation predicted that this chimeric transcript would produce a short N-terminal peptide corresponding to exon 1 of CLMN and the usually untranslated region of exon 2 of SYNE3 fused to the complete and in-frame SYNE3 protein. Transient overexpression of SYNE3 or the CLMN::SYNE3 fusion protein, in both HeLa cells and mouse primary cortical neurons, resulted in equivalent cellular outcomes including altered nuclear morphology and chromosomal DNA fragmentation. SYNE3 forms part of the linker of nucleoskeleton and cytoskeleton complex and is not usually expressed in cerebellar Purkyn[e] neurons while, CLMN has a Purkyn[e] specific expression pattern within the brain. Our data suggests that ectopic expression of SYNE3 in cerebellar Purkyn[e] neurons, mediated by the CLMN promoter, leads to cerebellar atrophy and causes spinocerebellar ataxia in the SCA30 family. This is an example of Mendelian disease arising from a novel, chimeric transcript with a likely dominant negative effect. Chimeric transcripts are commonly associated with cancers, but they are not often associated with monogenic disorders. Detection of chimeric transcripts as part of structural variant analysis could increase the genetic diagnostic yield of Mendelian disorders.

4
A long-read RNA sequencing and polysome profiling framework reveals transposable element-driven transcript diversity and translational rewiring in glioblastoma

Pizzagalli, M.; Sasipalli, S.; Leary, O.; Tran, L.; Haas, B.; Tapinos, N.

2026-04-21 cancer biology 10.64898/2026.04.18.719388 medRxiv
Top 0.3%
1.7%
Show abstract

BackgroundTransposable elements (TEs) account for over half of the human genome and are often derepressed in cancer. TEs can add cryptic splice sites, undergo exonization, and generate gene-TE fusion transcripts, but the combined effects of TEs on RNA processing and translation in glioblastoma stem cells (GSCs) remains incompletely elucidated. ResultsWe combined long-read RNA sequencing with polysome profiling in four patient-derived GSCs and two neural stem cell (NSC) controls to resolve TE-associated transcript diversity and its relationship to ribosomal engagement. Across GSCs, we identified 13,421 alternative splicing (AS) events, 3,077 of which contained TEs within 150 bp of splice junctions. AS sites proximal to TEs were associated with increased isoform switching compared to non-TE-associated AS sites (odds ratio 2.9 - 4.3). Moreover, AS isoforms generated from TE-proximal sites were more likely to exhibit altered ribosomal association (odds ratio 2.54). Directional shifts were observed, with shorter isoforms associating with monosome fractions and longer isoforms with polysome fractions. To enable systematic detection of gene - TE chimeric transcripts, we developed FuTER (Fusion TE Reporter), a long-read-based framework for identifying TE-associated fusions. Application to GSC datasets identified 78 GSC enriched fusion transcripts, several supported by breakpoint-spanning reads in polysome fractions, consistent with ribosome association. ConclusionsOur data suggest that TEs correlate with abnormal splicing activity and altered ribosome engagement in glioblastoma stem cells. By integrating long-read sequencing with polysome profiling and fusion detection, we establish a framework for analysis of TE-induced transcript diversity and its effects on cancer evolution and plasticity.

5
Determinants of DNA-sequence-based Diagnostic Yield in the CSER Consortium

Mavura, Y.; Crosslin, D.; Ferar, K. D.; Lawlor, J. M.; Greally, J. M.; Hindorff, L.; Jarvik, G. P.; Kalla, S.; Koenig, B. A.; Kvale, M.; Kwok, P.-Y.; Norton, M.; Plon, S. E.; Powell, B. C.; Slavotinek, A.; Thompson, M. L.; Popejoy, A. B.; Kenny, E. E.; Risch, N.

2026-04-22 genetic and genomic medicine 10.64898/2026.04.20.26351140 medRxiv
Top 0.4%
1.7%
Show abstract

PurposeDiagnostic yield from exome and genome sequencing varies widely across studies. It remains unclear how much of this variation reflects patient-level factors (e.g., sex, clinical features, race/ethnicity, genetic ancestry) versus site-level practices such as sequencing modality or variant interpretation workflows. We aimed to quantify the contributions of these factors to diagnostic outcomes across five U.S. clinical sequencing sites. MethodsWe performed a cross-sectional analysis of 3,008 prenatal, neonatal, and pediatric cases from the NHGRI Clinical Sequencing Evidence-Generating Research (CSER) consortium (2017-2023). Clinical indications spanned neurodevelopmental, neurological, immunological, metabolic, craniofacial, skeletal, cardiac, prenatal, and oncologic presentations. Genetic ancestry was inferred from sequencing data, and variants were interpreted using ACMG/AMP guidelines to classify DNA-based diagnoses. Generalized linear mixed models were used to estimate associations between diagnostic yield and fixed effects (sex, prenatal status, isolated cancer, number of clinical indications, sequencing modality, race/ethnicity, and genetic ancestry), while modeling study site as a random effect to quantify between-site variation. ResultsThe overall diagnostic yield was 19.0%. Multiple clinical indications (OR=1.47, 95% CI 1.20-1.80, p<0.001) were associated with higher diagnostic yield, and male sex (OR=0.80, 95% CI 0.66-0.96, p=0.017) and prenatal status (OR=0.63, 95% CI 0.44-0.90, p=0.012) were associated with lower yield. Sequencing modality, race/ethnicity, genetic ancestry, and isolated cancer were not statistically significantly associated with diagnostic outcomes.. A model without fixed effects attributed [~]10% of variance in diagnostic yield to between-site differences. After adjusting for covariates, site-level variance decreased to 5.7%, indicating consistent variation across sites not explained by measured patient factors. ConclusionAcross five sites, patient-level clinical features influenced diagnostic yield, but substantial site-level variation remained even after adjustment. Differences in variant interpretation, or case-classification practices may contribute to this residual variability. Further efforts to increase consistency in exome- and genome-sequencing diagnostic workflows may help reduce inter-site differences.

6
A rights-based intervention integrating social work and ophthalmic care for people experiencing or at risk of homelessness

Hassani, A.; Pecar, K.; Soliman, M.; Bunyon, P.; Ellinger, C.; Tulysewskid, G.; Croft, J.; Carillo, C.; Wewegama, G.; du Plessis-Schneider, S.; Estevez, J. J.

2026-04-24 public and global health 10.64898/2026.04.22.26351525 medRxiv
Top 0.5%
1.3%
Show abstract

Background Individuals experiencing or at risk of homelessness face substantial barriers to preventive eye care that are poorly addressed by standard service models. Interdisciplinary optometry-social work collaboration offers a rights-based approach to improving engagement and continuity of care. Methods A convergent mixed-methods study was conducted between February and August 2024 at a multidisciplinary community centre. Clients experiencing or at risk of homelessness received integrated optometry and social work assessment and were prioritised as high, medium, or low based on combined clinical and social risk. Social work follow-up was guided by the Triple Mandate and W-Questions framework. Quantitative data were summarised using mean (SD), median [IQR], or n (%). Qualitative case notes were analysed using content analysis with inductive coding and secondary review for consistency. Results A total of 165 clients had priority categories coded (high: 68; medium: 47; low: 154). Demographic data were available for 132 clients (60% male; mean age 49.5 years [SD 16]); 27% had not completed high school, 89% reported weekly income below AUD 1000, and 28% had vision impairment. Two hundred forty-five case-note entries were consolidated into 146 unique records. SMS (46%) and phone calls (38%) were the most documented contact methods, although only 21% of calls were answered; missed calls (13%) and disconnected numbers (7%) were common. Multi-modal contact was more frequently documented for higher-priority clients. Appointment assistance was the most recorded facilitator (71%), while rights-based supports, including interpreter and transport assistance, were infrequently documented (<=5%). Qualitative analysis identified unstable communication, reliance on informal supports, and service fragmentation as key influences on recall outcomes. Conclusion This study supports an interdisciplinary, rights-based optometry-social work model to address barriers to preventive eye care among people experiencing or at risk of homelessness. Embedding structured handovers and tiered recall processes within community-based services may strengthen continuity and accountability for high-priority clients. Future implementation should evaluate outcomes related to equity of reach, service integration, and sustained engagement in care.

7
PALM3 and hearing loss: a potential dual diagnosis interfering with novel gene discovery

Najarzadeh Torbati, P.; Hallbrucker, L.; Hofrichter, M. A. H.; Owrang, D.; Setzke, J.; Kilimann, M. W.; Hemmatpour, A.; Rajati, M.; Ghayoor Karimiani, E.; Haaf, T.; Vogl, C.; Vona, B.

2026-04-21 genetic and genomic medicine 10.64898/2026.04.20.26351093 medRxiv
Top 0.6%
1.2%
Show abstract

Hereditary hearing loss is highly genetically heterogeneous, with emerging overlap between genes implicated in early-onset and age-related hearing loss. We report a consanguineous family with autosomal recessive, non-syndromic hearing loss in which the proband harbors a homozygous splice-site variant in PALM3 (NM_001145028.2:c.314+1G>A) and a homozygous missense variant in OTOA. A minigene assay for the PALM3 variant demonstrated aberrant splicing with exon skipping, resulting in a frameshift and a large inframe deletion, both consistent with loss of function and impacting all known transcripts. While the organ of Corti from 12-month-old heterozygous Palm3 mice showed preserved overall architecture, published Palm3 knockout mice exhibit auditory dysfunction, supporting an auditory phenotype with loss of function. Although a dual molecular diagnosis cannot be excluded, the combined genetic, functional, and comparative data support PALM3 as a strong candidate gene for autosomal recessive hearing loss.

8
Novel Genetic Risk Loci for Pancreatic Ductal Adenocarcinoma Identified in a Genome-wide Study of African Ancestry Individuals

Vergara, C.; Ni, Z.; Zhong, J.; McKean, D.; Connelly, K. E.; Antwi, S. O.; Arslan, A. A.; Bracci, P. M.; Du, M.; Gallinger, S.; Genkinger, J.; Haiman, C. A.; Hassan, M.; Hung, R. J.; Huff, C.; Kooperberg, C.; Kastrinos, F.; LeMarchand, L.; Lee, W.; Lynch, S. M.; Moore, S. C.; Oberg, A. L.; Park, M. A.; Permuth, J. B.; Risch, H. A.; Scheet, P.; Schwartz, A.; Shu, X.-O.; Stolzenberg-Solomon, R. Z.; Wolpin, B. M.; Zheng, W.; Albanes, D.; Andreotti, G.; Bamlet, W. R.; Beane-Freeman, L.; Berndt, S. I.; Brennan, P.; Buring, J. E.; Cabrera-Castro, N.; Campa, D.; Canzian, F.; Chanock, S. J.; Chen, Y.;

2026-04-22 genetic and genomic medicine 10.64898/2026.04.21.26351329 medRxiv
Top 0.7%
0.9%
Show abstract

Pancreatic cancer disproportionately affects Black individuals in the United States, but they have limited representation in genetic studies of pancreatic ductal adenocarcinoma (PDAC). To address this gap, we performed admixture mapping and genome-wide association analysis (GWAS) in genetically inferred African ancestry individuals (1,030 cases and 889 controls). Admixture mapping identified three regions with a significantly higher proportion of African ancestry in cases compared to controls (5q33.3, 10p1, 22q12.3). GWAS identified a genome-wide significant association at 5p15.33 (CLPTM1L, rs383009:T>C, T Allele Frequency=0.51, OR:1.45, P value=1.24x10-8), a locus previously associated with PDAC. Known loci at 5p15.33, 7q32.3, 8q24.21 and 7q25.1 also replicated (P value <0.01). Multi-ancestral fine-mapping identified two potential causal SNPs (rs3830069 and rs2735940) at 5p15.33. Collectively these findings identified novel PDAC risk loci and expanded our understanding of this deadly cancer in underrepresented populations, emphasizing the multifactorial nature of PDAC risk including inherited genetic and non-genetic factors. Statement of SignificanceTo understand how genetic variation contributes to PDAC risk in Black people in North American, we studied individuals of genetically-inferred African ancestry. We identified novel risk loci and differences in the contribution of known loci. This demonstrates that ancestry-informed genetic analyses improve our understanding of PDAC risk and enhances discovery.

9
Loss of autism-associated gene wac alters social behavior and identifies cho-1 as a modulator of cholinergic signaling in C. elegans

Kim, D.-W.; Boonpraman, N.; Kuhn, N. C.; Sammi, S. R.

2026-04-21 neuroscience 10.64898/2026.04.17.719318 medRxiv
Top 0.8%
0.8%
Show abstract

WAC is an autism-associated gene involved in neurodevelopment. However, the effects of reduced WAC function on behavior and synaptic regulation in vivo remain unclear. Taking cues from the previous studies on the wac gene and the C. elegans model of ASD, we elucidated the effects of wac gene deletion on food-leaving behavior, a known parameter linked to ASD associated genes along with the cholinergic pathway. wac-deficient worms exhibited curtailed food-leaving behavior. Notably, observed phenotype was similar to that exhibited by nematodes with mutation in ASD related gene, neuroligin. In addition, wac-deficient worms showed impaired growth, reduced pharyngeal pumping, and lifespan. To examine potential synaptic mechanisms, we analyzed expression of genes related to cholinergic signaling across all developmental stages (L1-L4) through young adult (YA). Stage-specific transcriptional changes were observed, with increased expression of ace-1 and acr-3 at L1, acr-3 at L3, and acr-3, cha-1, lev-1, and lev-10 at L4. The transcriptomic alteration was most prominent at YA stage, exhibiting upregulation of ace-1, cha-1, cho-1, lev-1, lev-10, unc-17, unc-29, unc-38, and unc-50. To identify specific suppressor of upmodulated Ach signaling, RNAi of the upregulated genes was performed. cho-1 was identified as a specific suppressor of elevated Ach signaling. cho-1 encodes a high-affinity choline transporter responsible for choline uptake in the pre-synapse. These studies identify the molecular mechanisms pertaining to up-modulation of cholinergic signaling in wac mutant worms. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=112 SRC="FIGDIR/small/719318v1_ufig1.gif" ALT="Figure 1"> View larger version (24K): org.highwire.dtl.DTLVardef@1bdf8a9org.highwire.dtl.DTLVardef@1104825org.highwire.dtl.DTLVardef@1f09682org.highwire.dtl.DTLVardef@293b08_HPS_FORMAT_FIGEXP M_FIG C_FIG

10
Transcriptome-Wide Alternative Splicing Analysis Implicates Complex Events in Bipolar Disorder

Martinez-Jimenez, M.; Garcia-Ortiz, I.; Romero-Miguel, D.; Kavanagh, T.; Marshall, L. L.; Bello Sousa, R. A.; Sanchez Alonso, S.; Alvarez Garcia, R.; Benavente Lopez, S.; Di Stasio, E.; Schofield, P. R.; Baca-Garcia, E.; Mitchell, P. B.; Cooper, A. A.; Fullerton, J. M.; Toma, C.

2026-04-21 genetic and genomic medicine 10.64898/2026.04.19.26351209 medRxiv
Top 0.9%
0.7%
Show abstract

Alternative-splicing events (ASE) increase transcriptomic variability and play key roles in biological functions. The contribution of ASE to bipolar disorder (BD) remains largely unexplored. We performed a Transcriptome-Wide Alternative-Splicing Analysis (TWASA) to identify ASEs and genes potentially involved in BD. The study comprised 635 individuals: a discovery sample (DS) of 31 individuals from eight multiplex BD families (16 BD cases; 15 unaffected relatives), and a replication sample (RS) of 604 subjects (372 BD cases; 232 controls). Sequencing was conducted on RNA from lymphoblastoid cell lines (DS) and whole blood (RS). TWASA was performed using VAST-TOOLS (VT), rMATS (RM), and MAJIQ/MOCCASIN (MCC). Gene-set association analyses of genes containing ASEs were performed across six psychiatric disorders. Novel ASE (nASE) were investigated in the DS using FRASER. Limited gene overlap was observed across TWASA tools. MCC identified 2,031 complex ASEs involving 1,508 genes, showing the strongest genetic association with BD across psychiatric phenotypes. Prioritization of MCC-identified ASE genes yielded 441 candidates, including DOCK2 as top candidate from the DS. Replication was obtained for 98 genes, five with an identical ASE, and four (RBM26, QKI, ANKRD36, and TATDN2) showing a concordant percentage-spliced-in direction with the DS. Finally, 578 nASE were identified in the DS, with no evidence of familial segregation or differences in ASE types. This first TWASA in BD reveals tool-specific variability, complex ASE for genes specifically associated with BD, and novel candidate genes for BD. Alternative transcript isoform abundance may represent a mechanism contributing to BD pathophysiology.

11
THRB splice site variants lead to exon 4 skipping and TRβ1 gain-of-function syndrome

Hones, G. S.; Liao, X.-H.; Mahler, E. A.; Herrmann, P.; Eckstein, A.; Fuhrer, D.; Castillo, J. M.; Chiang, J.; Vincent, A. L.; Weiss, R. E.; Dumitrescu, A. M.; Refetoff, S.; Moeller, L. C.

2026-04-22 endocrinology 10.64898/2026.04.15.26349265 medRxiv
Top 0.9%
0.7%
Show abstract

BackgroundHeterozygous c.283+1G>A and c.283G>A variants in the THRB gene, encoding for thyroid hormone receptor (TR){beta}1 and {beta}2, lead to autosomal dominant macular dystrophy (ADMD). We report the detailed clinical characterization of two first-degree relatives with ADMD, heterozygous for THRB c.283+1G>A, and an unrelated ADMD patient with a novel variant, c.283G>C. The genomic and molecular consequences of both variants were studied. MethodsgDNA and mRNA were obtained from leukocytes. Clinical characterization included biochemistry, bone density and body composition, ECG, echocardiography, ultrasound, audiometry and color-vision. In vitro assays investigated TR function and DNA binding. ResultsThe patients manifested no resistance to thyroid hormone beta (RTH{beta}) and had normal FT4 and TSH. Detailed studies in two patients showed no goiter, tachycardia, hypercholesterinemia or hepatic steatosis. Hearing was not impaired. Both had impaired color vision and reduced bone density. RT-PCR from all three patients revealed skipping of exon 4 exclusive to TR{beta}1, producing a deletion of 87 amino acids in the N-terminal domain (TR{beta}1{Delta}NTD). In vitro, DNA-binding affinity of TR{beta}1{Delta}NTD to DR4-TRE with or without RXR was comparable to TR{beta}1WT. Surprisingly, TR{beta}1{Delta}NTD was transcriptionally twice more active than TR{beta}1WT with a similar EC50 for T3, demonstrating gain-of-function of TR{beta}1{Delta}NTD. THRA expression in leukocytes was increased by 3-fold compared to unrelated controls and different from RTH{beta} patients. ConclusionThese THRB splice site variants produce TR{beta}1 exon 4 skipping, resulting in a gain-of-function mutant, TR{beta}1{Delta}NTD. This explains the dominant ADMD phenotype devoid of RTH{beta} and suggests a TR{beta}1 gain-of-function syndrome.

12
Subtypes of Internalizing and Externalizing Problems in Autistic Preschool Children: Participation in Daily Life and Family Outcomes

Nakamura, T.; Koshio, I.; Nagayama, H.

2026-04-21 psychiatry and clinical psychology 10.64898/2026.04.14.26350723 medRxiv
Top 0.9%
0.7%
Show abstract

AimAutistic children have a high but varied prevalence of internalizing and externalizing problems. This study aimed to identify the subtypes of internalizing and externalizing problems among autistic preschool children in Japan, examine their temporal stability, and investigate differences in participation in daily life and family outcomes across these subtypes. MethodsA prospective cohort study was conducted with 275 caregivers of autistic children aged 51-75 months. Internalizing and externalizing problems were assessed using the Strengths and Difficulties Questionnaire. ResultsLatent transition analysis identified five subtypes: Low-symptom, High-emotional, Externalizing, Comorbid, and Peer-difficulty groups. Membership in the High-emotional and Externalizing groups was relatively stable over time, whereas the Peer-difficulty group showed frequent transitions to subtypes with higher levels of internalizing or externalizing problems. Significant differences in participation in daily life and family outcomes were observed across subtypes, but these patterns were inconsistent with a simple gradient of symptom levels. ConclusionsThe novel findings that the temporal stability of subtype membership varied and that differences in participation in daily life and family outcomes were observed across the subtypes suggest that the heterogeneity of internalizing and externalizing problems may be associated with variations in childrens participation in daily life and family outcomes over time. Plain Language SummaryAutistic preschool children often experience emotional and behavioral difficulties, but the way these difficulties manifest varies widely across individuals. This study aimed to identify the patterns of these difficulties, examine how they change over time, and investigate how participation in daily life and family outcomes differ across autistic preschool children. We conducted a study with 275 caregivers of autistic children aged 4-6 years in Japan. From caregiver reports of childrens emotional and behavioral difficulties, five distinct patterns were identified: a group with mainly emotional difficulties, a group with mainly behavioral difficulties, a group with both types of difficulties, a group with relatively low levels of difficulties, and a group characterized primarily by peer-related difficulties. Our findings suggest that different patterns of emotional and behavioral difficulties are associated with differences in childrens participation in daily life and family outcomes. These differences could not be explained simply by the overall severity of difficulties but rather reflect distinct patterns based on the type of difficulty. The results indicate that autistic children face diverse difficulties that change over time.

13
Cross-ancestry evaluation of idiopathic pulmonary fibrosis genetic risk variants

Nabunje, R.; Guillen-Guio, B.; Hernandez-Beeftink, T.; Joof, E.; Leavy, O. C.; International IPF Genetics Consortium, ; Maher, T. M.; Molyneux, P.; Noth, I.; Urrutia, A.; Aburto, M.; Flores, C.; Jenkins, R. G.; Wain, L. V.; Allen, R. J.

2026-04-25 genetic and genomic medicine 10.64898/2026.04.17.26349970 medRxiv
Top 1%
0.7%
Show abstract

Genome-wide association studies of idiopathic pulmonary fibrosis (IPF) have identified 35 common genetic risk loci associated with IPF susceptibility. In this study, we evaluated the effects of the reported variants in clinically curated non-European individuals. Despite limited sample sizes, we observed partial replication, limited transferability of some variants and evidence of ancestry-specific effects. The MUC5B promoter variant rs35705950 emerged as the dominant and most consistent signal across ancestries. Our findings highlight the need for larger, well-characterised studies in understudied populations to support robust discovery and translation.

14
Racioethnic Disparities in Risk of Cardiometabolic Risk Factors and Cardiovascular Disease among Women Treated for Breast Cancer: The Pathways Heart Study

Yao, S.; Zimbalist, A.; Sheng, H.; Fiorica, P.; Cheng, R.; Medicino, L.; Omilian, A.; Zhu, Q.; Roh, J.; Laurent, C.; Lee, V.; Ergas, I.; Iribarren, C.; Rana, J.; Nguyen-Huynh, M.; Rillamas-Sun, E.; Hershman, D.; Ambrosone, C.; Kushi, L.; Greenlee, H.; Kwan, M.

2026-04-24 epidemiology 10.64898/2026.04.23.26351612 medRxiv
Top 1%
0.5%
Show abstract

Background: Few studies have examined racioethnic disparities in cardiovascular disease (CVD) in women after breast cancer treatment, who are at higher risk due to cardiotoxic cancer treatment. Methods: Based on the Pathways Heart Study of women with a history of breast cancer, this analysis examines the association between cardiometabolic risk factors (hypertension, diabetes, and dyslipidemia) and CVD events with self-reported race and ethnicity, as well as genetic similarity. Multivariable logistic and Cox proportional hazards regression models were used to test race and ethnicity and genetic similarity with prevalent and incident cardiometabolic risk factors and CVD events. Results: Of the 4,071 patients in this analysis, non-Hispanic Black (NHB), Asian, and Hispanic women were more likely to have prevalent and incident diabetes than non-Hispanic White (NHW) women. Analysis of genetic similarity revealed results consistent with self-reported race and ethnicity. For CVD risk, NHB women were more likely to develop heart failure and cardiomyopathy than NHW women. In contrast, Hispanic women were at lower risk of any incident CVD, serious CVD, arrhythmia, heart failure or cardiomyopathy, and ischemic heart disease, which was consistent with the associations found with Native American ancestry. Conclusions: This is the largest multi-ethnic study of disparities in CVD health in breast cancer survivors, demonstrating corroborating findings between self-reported race and ethnicity and genetic similarity. The results highlight disparities in cardiometabolic risk factors and CVD among breast cancer survivors that warrant more research and clinical attention in these distinct, high-risk populations.

15
Zebrafish Functional Screening of FDA-Approved Drugs for Autosomal Dominant Retinitis Pigmentosa Caused by RHODOPSIN Q344X Mutation

Wang, B.; Ganzen, L.; Coskun, E.; James, R.; Kha, T.; Zhu, X.; New, J. A.; Tsujikawa, M.; Leung, Y. F.

2026-04-21 neuroscience 10.64898/2026.04.18.719270 medRxiv
Top 1%
0.4%
Show abstract

Retinitis Pigmentosa (RP) is a group of inherited retinal degenerations for which most subtypes lack effective drug treatments. This challenge is particularly critical for autosomal dominant (ad) RP, which is often unsuitable for gene replacement therapy. To address this challenge, we screened an FDA-approved compound library using a zebrafish adRP model expressing a human RHODOPSIN transgene with the Q344X mutation. The screen evaluated drug effects on larval visual behavior by assessing the visual-motor response (VMR). Four compounds significantly improved VMR in Q344X zebrafish: amitriptyline, difluprednate, maprotiline, and prednisolone. Further characterization revealed that these hits act through distinct mechanisms, including reducing rod death, promoting rod neogenesis, and enhancing the function of extraocular photoreceptors. Together, these findings demonstrate the potential to repurpose these drugs for adRP caused by the RHO Q344X mutation, providing preclinical candidates and revealing potential targets for future drug development.

16
Transcriptomic analysis of organotypic porcine retina cultures

khosravi, s.; Giorgio, G.; Staurenghi, F.; schoenberger, t.; Gross, P.; Ried, M.; Frankenhauser, J.; Eder, S.; Markert, E.; Bakker, R.; Babaei, S.; Zippel, N.

2026-04-21 molecular biology 10.64898/2026.04.16.718959 medRxiv
Top 2%
0.4%
Show abstract

Porcine organotypic retinal explant cultures are widely used to study retinal neurodegeneration under controlled conditions, but the biological process that occurs in the retinal explant over time due to preparation-induced injury and culture are not well understood. Here, we generated a time-resolved transcriptomic reference for porcine neural retinal explants-maintained ex vivo for 10 days. Global expression profiles are strongly separated by culture time, with Day 0 clearly distinct from cultured samples and at Day 7 and Day 10 showing the highest similarity, indicating a transition toward a later stabilized state. Across the time course, 3,187 genes were differentially expressed relative to Day 0, with the largest shifts occurring at an early stage of culture (Day 1-Day 3). Pathway-level analyses revealed coordinated remodeling involving inflammatory signaling, and metabolic/bioenergetic changes, including reduced mitochondrial and oxidative phosphorylation-related programs at later time points. Here, we provide a time-resolved transcriptomics reference dataset for cultured porcine retinal explants. These data can build a foundation to interpret data generated in this model, differentiate changes inherent to the explant culture from treatment-specific effects and to select appropriate experimental windows for mechanistic studies of retinal degeneration.

17
Genetic and Proteomic Investigation of the Smoking-Parkinson Disease Association

Shi, M.; Gunawan, T.; Setzer, M.; Okashah, N.; Liu, Y.; Wingo, T. S.; Wingo, A. P.; Weintraub, D.; Schwarzschild, M. A.; Rentsch, C. T.; Kranzler, H. R.; Gray, J. C.

2026-04-20 genetic and genomic medicine 10.64898/2026.04.17.26351138 medRxiv
Top 2%
0.4%
Show abstract

BackgroundEpidemiological studies show an inverse association between cigarette smoking and Parkinsons disease (PD), suggesting a potential protective effect of smoking on PD incidence, despite the well-established and overwhelming harms of smoking to human health. We integrated genomic and proteomic approaches to investigate the causality and molecular basis of this potential relationship. MethodsWe analyzed summary statistics from genome-wide association studies (GWAS) of smoking initiation (SmkInit), smoking intensity, and PD. Two-sample Mendelian randomization (MR) tested whether genetic liability to smoking behaviors causally influences PD risk. Shared genomic architecture was quantified using MiXeR, and conjunctional false discovery rate (conjFDR) analysis identified loci jointly associated with smoking and PD, which were then mapped to genes and tested for tissue enrichment. To identify mediating proteins, we integrated dorsolateral prefrontal cortex proteomic data with GWAS using proteome-wide association studies (PWAS), summary-based MR, heterogeneity in dependent instruments testing, and colocalization. Finally, the druggability of convergent genes was evaluated. ResultsMR analyses indicated a protective effect of genetic liability to SmkInit on PD risk (OR = 0.78, 95% CI: 0.67-0.91, P = 1.5 x 10-3), which was consistent across sensitivity analyses and not suggestive of directional pleiotropy. However, no significant effect of genetic liability to cigarettes per day (CigDay) on PD risk was found. MiXeR revealed modest polygenic overlap between SmkInit and PD (13.9%; genetic correlation rg = -0.16) and between CigDay and PD (22.9%; rg = -0.09). ConjFDR identified 95 shared loci for SmkInit-PD and 26 for CigDay-PD. SmkInit-PD loci mapped to genes involved in neurotrophic signaling, synaptic organization, microglial modulation, and mitochondrial stress responses, with expression enriched in substantia nigra, basal ganglia, and interconnected cortical regions. PWAS identified 11 proteins shared by PD and SmkInit and 5 shared with CigDay, several of which (AKT3, MAPT, RIT2, EXD2, and PPP3CC) were supported by both genomic and proteomic analyses. Druggability assessment highlighted six proteins with existing pharmacologic modulation potential, spanning neurotrophic, microglial, proteostatic, and ion-channel pathways. ConclusionsGenetic liability to smoking initiation appears to confer modest protection against PD. Integrative genomic and proteomic evidence converges on neurotrophic, synaptic, microglial, and mitochondrial pathways as shared mechanisms, identifying biologically coherent potential therapeutic targets for advancing smoke-free neuroprotective strategies in PD.

18
Uncertainty-Gated Glaucoma Screening: Combining Semi-Supervised Classification with Multi-Agent Large Language Model Deliberation

Garimella Narasimha, S. V.; Brown, N.; Sridhar, S.

2026-04-20 ophthalmology 10.64898/2026.04.17.26351127 medRxiv
Top 2%
0.4%
Show abstract

Automated glaucoma screening from optical coherence tomography (OCT) faces two persistent challenges: scarcity of expert-labeled data and unreliable model predictions on diagnostically ambiguous cases. We present a two-tier diagnostic pipeline that addresses both. In the first tier, an EfficientNetV2-S classifier trained under a semi-supervised pseudo supervisor framework achieves 0.84 AUC on 150 held-out test patients from the Harvard Glaucoma Detection and Progression dataset, using only 350 labeled training samples out of 700. In the second tier, 124 flagged cases are routed to a multi-agent system built on MedGemma 4B, where three specialist agents deliberate over three rounds before rendering a final diagnosis. On these flagged cases, the agent system achieves 100% sensitivity--detecting all 55 glaucoma cases with zero missed diagnoses--and 89.5% overall accuracy (111/124), compared to the classifiers 73.4% (91/124). Uncertainty analysis confirms that the classifiers output probability reliably separates confident predictions (96.3% accuracy, n = 27) from uncertain ones (74.0%, n = 123), producing a 22-percentage-point gap that serves as a triage signal. The agents fix 32 cases the classifier misclassifies while introducing 12 new errors, yielding a net improvement of 20 cases. These results are from a single training run without variance estimates and should be interpreted as preliminary evidence that uncertainty-gated routing to vision-language model agents can meaningfully improve diagnostic accuracy on the cases where automated classifiers are least reliable.

19
Investigating Uptake and Impact of Genetic and Genomic Evaluation Following a Perinatal Demise

Mossler, K.; D'Orazio, E.; Hall, K.; Osann, K.; Kimonis, V.; Quintero-Rivera, F.

2026-04-23 genetic and genomic medicine 10.64898/2026.04.22.26347546 medRxiv
Top 2%
0.4%
Show abstract

Objective The decline of the perinatal demise rate is slowing and demises are often unexplained. Significant research has been done regarding diagnostic yield and genetic causes of demise, but little is known about how Geneticist involvement impacts outcomes. The goal of the study was to evaluate post-mortem genetic testing practices and effects of the geneticists involvement. Methods Retrospective data from 111 perinatal demise cases was examined, including rates of prenatal genetic counseling, post-delivery genetics consult, genetic testing, and autopsy investigation. Results In this cohort 54% received genetic testing and 25% received a genetics consult. When compared to those without, cases with genetic specialist involvement were associated with significant increases in testing uptake (p=0.007), diagnostic yield (p<0.001), and patient education (p<0.001). Second trimester stillbirths and those with fewer ultrasound (US) abnormalities were less likely to receive genetic testing (both p values <0.001) and consults (p<0.001, p=0.020). Conclusion Though it was not possible to avoid ascertainment bias, this data demonstrates that geneticist involvement correlates with a higher rate of testing, greater diagnostic yield, and more thorough counseling. These findings underscore the importance of integrating genetics providers into perinatal postmortem healthcare teams.

20
Onca: An Open 9B Language Model for Pancreatic Cancer Clinical Tasks

Shim, K. B.

2026-04-24 oncology 10.64898/2026.04.16.26351055 medRxiv
Top 2%
0.4%
Show abstract

Pancreatic ductal adenocarcinoma (PDAC) remains one of the deadliest solid tumors and continues to face low treatment-trial participation, fragmented evidence workflows, and labor-intensive ab- straction of unstructured clinical text. Existing oncology-focused language models show promise, but many depend on private institutional corpora, limiting reproducibility and practical reuse across centers. We present Onca, an open 9B dense model designed for four PDAC-relevant tasks: trial eligibility screening, case-specific clinical reasoning, structured pathology report extraction, and molecular variant evidence reasoning. Onca is fine-tuned from Qwopus3.5-9B-v3 with a single Un- sloth BF16 LoRA adapter on 37,364 training rows drawn from openly available sources. The evalu- ation spans 11 panels and compares Onca against Woollie-7B, CancerLLM-7B, OpenBioLLM-8B, and the unmodified Qwopus base. Onca achieves the strongest overall results on Trial Screening (81.6 F1), Clinical Reasoning (14.1 composite), Pathology Extraction (30.5 field exact-match), Pub- MedQA Cancer (68.3 macro-F1), and PubMedQA (66.5 macro-F1). The strongest gains appear in tasks closest to routine oncology workflow, especially trial review and pathology structuring. These findings suggest that clinically targeted pancreatic-cancer language models can be built from open data with competitive performance while remaining practical to train on a single workstation-scale GPU setup.